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to see how much a learner — whether a computer, a child, or a second 
language learner — must master. (MSE) 
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Overview 



This paper reports empirical work in artificial inteligence (AI) on natural 
language processing (NLP). AI has come to mean many things to many 
people. On one extreme we have engineers who are primarily interested in 
software tools with which to build computer programs. On another extreme, 
AI touches on issues in philosophy of mind and philosophy of science that 
often rouse strong emotions. 

The kind of AI that we are interested in is cognitive modelling. The goal here 
is to describe some aspect of human mental activity, say problem solving or 
language use, in computational terms. The central issues is this kind of study 
turn out to be representation and reasoning (corresponding roughly to data 
structures and control in computer prograuis). By representation we mean 
the content and organization of information used. By reasoning we mean not 
only inference in the traditional sense but generally any method for drawing 
conclusions from the data. This includes analogical reasoning, heuristics 
for educated guesses, methods for deciding between conflicting evidence, 
etc. The details of representation and reasoning are a major concern of AI 
workers and controversies rage among proponents of different schools. Some 
researchers argue that only when the representation is given in predicate 
logic and the reasoning is reducible to inference in some formal sense can 
an AI model be considered seriously. Others argue that the representation 
should be given in a network which resembles a neural network and that 
reasoning should be activation (with, perhaps, some inhibition) over the 
network. For our present purposes, these implementation detaUs are not 
important. What is important is what knowledge is needed and what is 
done with it. 

The AI-NLP Enterprise 

In this paper we will examine closely several problems in language under- 
standing involving semantic ambiguity, anaphoric reference, and metonymy. 
These examples come from our own work and the work of other AI re- 
searchers. 

Ambiguity 

Consider the following examples: 
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1(a). John smoked the pot. 

1(b). John put the pot in the dishwasher. 

1(c). John heard the police coining. He qmckly hid the pot in the dish- 
washer. 

The issue here is disambiguating pot as ^cooking pot' or as ^maxijuaua'. 
Assuming that the listener has disambiguated smoked (as ^burned in order 
to inhale' rather than 'prepared food by exposing to smoke' or other more 
informal uses), the first example seems straightforward. Only the 'mari- 
juana' sense satisfies the selectional restrictions on the appropriate meaning 
of 'smoked'. The seccnd example is more difficult. Esther meaning of *pot' 
fits the selectional restrictions and, of course, the sentence is ambiguous. 
Nonetheless, when this sentence is heard in isolation, most people prefer the 
'cooking pot' meaning. Presumably people have knowledge of the normal 
uses of a dishwasher. The third example is the most complex. It depends 
on knowing the usual role of police and the social and legal status of mari- 
juana versus cooking pots. In a society where marijuana was legal but pots 
were valuable (and therefore possibly stolen) we might disambiguate 'pot' 
differently. Examples like this riise difficult issues of meaning. The fact that 
marijuaiia is currently illegal in this society isn't clearly a "compositional" 
feature of its meaning. In this story, 'dishwasher' is also ambiguous (between 
person and machine) aud is disambiguated by 'hid ... in'. Understanding 
this story then depends on a less-obvious aspect of dishwashing machines, 
namely that they may be treated as opaque containers. 

The pcrint here is that ambiguity is common aud even in simple cases requires 
quite a bit of world knowledge to resolve. In building systems that can 
handle this kind of ambiguity, we quickly encounter problems in organizing 
the necessary knowledge aud in activating the ''right" pieces. Why is the 
illegality of marijuana relevant for the third example and not for the first 
two? Can a dishwvsher only be used on cooking pots and not on marijuana? 
How do we know that? (I.e., what kind of organization of knowledge or 
reasoning allows us to make the right conclusion?) The reasoning needed 
to choose the appropriate evidence and to choose a preferred meaning is 
complex. 
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Metonymy 

Metonymy is in general the use of one thing to refer to something related, 
as in sajring *the sword' to mean military action or in referring to a diner 
customer as *the ham sandwich\ It is often treated as a literary device but 
in fact is in conunon use even in technical contexts. In their study of ill- 
formed input, Carbonell and Hayes (1983) examine a front end to a database 
of computer equipment in which the user says: 

2. Can you connect a video disk drive to the two megabytes? 

Of course, the author doesn't want to add the drive to the count or even 
the memory being counted. The drive is to be added to the compute^ (men- 
tioned earlier in the discourse) that has the two megabytes (as compared 
to some other computer mentioned). These examples are easy for people 
kno^edgible in the area to understand but it is difficult to formulate any 
general algorithm describing how we do it. People without spediic knowl- 
edge of computers might find the discourse incomprehensible. 

Anaphoric reference 

Not only do we need appropriate representations to process the sentences 
we hear, but we also need to be able to modify the representations in the 
course of listening. Consider the following situations: 

3(a). On the shelf, I have a hat from the Super Bowl and a cup from Chicago. 
These things are my newest souvenirs. 

3(b). I found a nickel, a dime and a quarter in the drawer. The oldest of 
these was from 1S19. 

In the first example, a listener must resolve the reference of ^these tlungs\ A 
simple heuristic is to track back to the most recent set of possible referents 
that can be considered Hhings*. But no such set has been mentioned. An 
understander (whether natural or artificial) must create the set from an 
appropriate list of tHngs. In the second example, ^oldest' clearly refers to 
a set that is orderc-d by time. The understander must not only form the 
set but must know enough about the objects to realize that a time order is 
possible. 
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It is not obvious what kind of a strategy gives us the right r^^presentation. 
Given the enormous number of possible sets that we might have to create 
and the enormous number of ways to construe them, it seems unlikely that 
we store every possiUe combination of the things we know. So we must 
be able to create at least some of the knowledge structures in response 
to the conversational demands. But what process allows us to build the 
right sets? In example 3(a), the things are the hat and the cup, but not 
the shelf, so it can't be that we just form groups of things. The language 
processor should know something about souvenirs and the kinds of things 
that get chosen. But the shelf might be a souvenir from a time of interest in 
woodworking. In example 3(b), the age of the drawer isn't important, is it? 
A language underrtander also must understand when an object (or action or 
set of objects or . . . ) is in focus. Focus is a function of many things, including 
our knowledge of how language is used in interaction and knowledge about 
the social roles of objects and actions. Understanding focus seems to be to 
a great extent cultural. 

The Importance of the Computational Approach 

Much of the work we do in AI looks simple. Certainly, none of these sen- 
tences seem particularly difficult to understand. But in trying to build an 
actual artificial understander we are forced to make explicit the assump- 
tions, knowledge and processes we use. As these examples show, even sim- 
ple sentences can require a lot of world knowledge and non- trivial reasoning 
strategists. Of course, w" are interested in finding general rules for language 
understanders. What form of organization of the knowledge allows us to ac- 
cess the information we need when we need it? What heuristics do we use to 
choose the most useful features of the knowledge? What assumptions do we 
need to make about other speakers and listeners in order to conmiunicat3? 

This work is interesting because by insisting on specific mechanisms we can 
make explicit the enormous amount of work we do thoughtlessly when we 
engage in even simple reading or conversation. Because our years of expe- 
rience m thinking, reasoning, persuading, etc. allow us to process langaage 
almost automatically it is easy to forget just how complicated this process- 
ing is. It is easy and tempting to try to reduce this complexity by assuming 
a few simple, formal ndes or by assuming that what is obvious to us must 
be obvious to everybody else. It is easy to underestimate the sophistication 
of the learning and reasoning strategies of small children, or even infants. 
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This kind of thinking is possibly more dehumanizing than the technology 
we often worry so much about. By examining cognitive processing at the 
micro level we can see how much a learner, whether a computer, a child or 
a second language learner, has to mast^^r. 

Beyond the details of language understanding, the surest lesson from this 
kind of research is that each one of us is extremely complex and capable of 
ordinary feats that still stump the best computers. 
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